Model Formulation: Evaluating Predictors of Geographic Area Population Size Cut-offs to Manage Re-identification Risk
نویسندگان
چکیده
OBJECTIVE In public health and health services research, the inclusion of geographic information in data sets is critical. Because of concerns over the re-identification of patients, data from small geographic areas are either suppressed or the geographic areas are aggregated into larger ones. Our objective is to estimate the population size cut-off at which a geographic area is sufficiently large so that no data suppression or further aggregation is necessary. DESIGN The 2001 Canadian census data were used to conduct a simulation to model the relationship between geographic area population size and uniqueness for some common demographic variables. Cut-offs were computed for geographic area population size, and prediction models were developed to estimate the appropriate cut-offs. MEASUREMENTS Re-identification risk was measured using uniqueness. Geographic area population size cut-offs were estimated using the maximum number of possible values in the data set and a traditional entropy measure. RESULTS The model that predicted population cut-offs using the maximum number of possible values in the data set had R2 values around 0.9, and relative error of prediction less than 0.02 across all regions of Canada. The models were then applied to assess the appropriate geographic area size for the prescription records provided by retail and hospital pharmacies to commercial research and analysis firms. CONCLUSIONS To manage re-identification risk, the prediction models can be used by public health professionals, health researchers, and research ethics boards to decide when the geographic area population size is sufficiently large.
منابع مشابه
A Logistic Regression Analysis of Predictors for Asthma Hospital Re-admissions
In order to identify the risk factors (predictors) of re-hospitalisation for high-risk asthmatic patients, a retrospective logistic regression analysis describing the relationship between the probability of re-admission and possible predictors in hospitalised asthmatics, aged over 5 years, between 1994-1998, was designed. Study setting was a district general hospital in the West Yorkshire, UK. ...
متن کاملA Logistic Regression Analysis of Predictors for Asthma Hospital Re-admissions
In order to identify the risk factors (predictors) of re-hospitalisation for high-risk asthmatic patients, a retrospective logistic regression analysis describing the relationship between the probability of re-admission and possible predictors in hospitalised asthmatics, aged over 5 years, between 1994-1998, was designed. Study setting was a district general hospital in the West Yorkshire, UK. ...
متن کاملMetabolic syndrome and its predictors in an urban population in Kenya: A cross sectional study
BACKGROUND The metabolic syndrome (MetS) is a clustering of interrelated risk factors which doubles the risk of cardio-vascular disease (CVD) in 5-10 years and increases the risk of type 2 diabetes 5 fold. The identification of modifiable CVD risk factors and predictors of MetS in an otherwise healthy population is necessary in order to identify individuals who may benefit from early interventi...
متن کاملDefining Pathways and Trade-offs Toward Universal Health Coverage; Comment on “Ethical Perspective: Five Unacceptable Trade-offs on the Path to Universal Health Coverage”
The World Health Organization’s (WHO’s) World Health Report 2010, “Health systems financing, the path to universal coverage,” promoted universal health coverage (UHC) as an aspirational objective for country health systems. Yet, in addition to the dimensions of services and coverage, distribution of coverage in the population, and financial risk protection highlighted by the report, the conside...
متن کاملThe impact of Economic and Geographic indicators in Trade in OIC Countries (Using Gravity Model)
Present paper is an attempt to estimate the impact of Economic and Geographic indicators in trade among Islamic countries according to a bilateral trade model as Gravity model, and study the relationship between Economy, Geography and Trade in this way. Fixed effect version of the panel data estimation producer with OIC member country data spanning over the 2007–2012. The result of this researc...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Journal of the American Medical Informatics Association : JAMIA
دوره 16 2 شماره
صفحات -
تاریخ انتشار 2009